- E < p. ˆ p q ˆ E = q ˆ = 1 - p ˆ = sample proportion of x failures in a sample size of n. where. x n sample proportion. population proportion

Similar documents
Overview. p 2. Chapter 9. Pooled Estimate of. q = 1 p. Notation for Two Proportions. Inferences about Two Proportions. Assumptions

Common Large/Small Sample Tests 1/55

MOST PEOPLE WOULD RATHER LIVE WITH A PROBLEM THEY CAN'T SOLVE, THAN ACCEPT A SOLUTION THEY CAN'T UNDERSTAND.

7-1. Chapter 4. Part I. Sampling Distributions and Confidence Intervals

Section 9.2. Tests About a Population Proportion 12/17/2014. Carrying Out a Significance Test H A N T. Parameters & Hypothesis

Chapter 8: Estimating with Confidence

Inferential Statistics. Inference Process. Inferential Statistics and Probability a Holistic Approach. Inference Process.

Lecture 5. Materials Covered: Chapter 6 Suggested Exercises: 6.7, 6.9, 6.17, 6.20, 6.21, 6.41, 6.49, 6.52, 6.53, 6.62, 6.63.

HYPOTHESIS TESTS FOR ONE POPULATION MEAN WORKSHEET MTH 1210, FALL 2018

(7 One- and Two-Sample Estimation Problem )

Class 23. Daniel B. Rowe, Ph.D. Department of Mathematics, Statistics, and Computer Science. Marquette University MATH 1700

Math 140 Introductory Statistics

LESSON 20: HYPOTHESIS TESTING

BIOS 4110: Introduction to Biostatistics. Breheny. Lab #9

Statistical Inference About Means and Proportions With Two Populations

MATH/STAT 352: Lecture 15

Chapter 13, Part A Analysis of Variance and Experimental Design

Working with Two Populations. Comparing Two Means

Final Examination Solutions 17/6/2010

Introduction to Econometrics (3 rd Updated Edition) Solutions to Odd- Numbered End- of- Chapter Exercises: Chapter 3

Mathacle. PSet Stats, Concepts In Statistics Level Number Name: Date:

Statistical Intervals for a Single Sample

Continuous Data that can take on any real number (time/length) based on sample data. Categorical data can only be named or categorised

Sample Size Determination (Two or More Samples)

Interval Estimation (Confidence Interval = C.I.): An interval estimate of some population parameter is an interval of the form (, ),

Properties and Hypothesis Testing

Statistics 300: Elementary Statistics

This chapter focuses on two experimental designs that are crucial to comparative studies: (1) independent samples and (2) matched pair samples.

Mathacle. PSet Stats, Concepts In Statistics Level Number Name: Date: Confidence Interval Guesswork with Confidence

z is the upper tail critical value from the normal distribution

Mathematical Notation Math Introduction to Applied Statistics

ENGI 4421 Confidence Intervals (Two Samples) Page 12-01

1 Inferential Methods for Correlation and Regression Analysis

Econ 325 Notes on Point Estimator and Confidence Interval 1 By Hiro Kasahara

PSYCHOLOGICAL RESEARCH (PYC 304-C) Lecture 9

Chapter 4 Tests of Hypothesis

Statistics. Chapter 10 Two-Sample Tests. Copyright 2013 Pearson Education, Inc. publishing as Prentice Hall. Chap 10-1

Frequentist Inference

Chapter 22. Comparing Two Proportions. Copyright 2010, 2007, 2004 Pearson Education, Inc.

STA Learning Objectives. Population Proportions. Module 10 Comparing Two Proportions. Upon completing this module, you should be able to:

STAT431 Review. X = n. n )

Chapter 13: Tests of Hypothesis Section 13.1 Introduction

2 1. The r.s., of size n2, from population 2 will be. 2 and 2. 2) The two populations are independent. This implies that all of the n1 n2

Statistics 20: Final Exam Solutions Summer Session 2007

Recall the study where we estimated the difference between mean systolic blood pressure levels of users of oral contraceptives and non-users, x - y.

1036: Probability & Statistics

Chapter 1 (Definitions)

Tests of Hypotheses Based on a Single Sample (Devore Chapter Eight)

Chapter 23: Inferences About Means

Stat 200 -Testing Summary Page 1

If, for instance, we were required to test whether the population mean μ could be equal to a certain value μ

October 25, 2018 BIM 105 Probability and Statistics for Biomedical Engineers 1

Math 152. Rumbos Fall Solutions to Review Problems for Exam #2. Number of Heads Frequency

Successful HE applicants. Information sheet A Number of applicants. Gender Applicants Accepts Applicants Accepts. Age. Domicile

Estimating the Population Mean - when a sample average is calculated we can create an interval centered on this average

A quick activity - Central Limit Theorem and Proportions. Lecture 21: Testing Proportions. Results from the GSS. Statistics and the General Population

Chapter 22. Comparing Two Proportions. Copyright 2010 Pearson Education, Inc.

Topic 9: Sampling Distributions of Estimators

FACULTY OF MATHEMATICAL STUDIES MATHEMATICS FOR PART I ENGINEERING. Lectures

MA238 Assignment 4 Solutions (part a)

Announcements. Unit 5: Inference for Categorical Data Lecture 1: Inference for a single proportion

Chapter 22: What is a Test of Significance?

MTH 345 Exam 2 Review Questions Fall Based on a Bellowes survey of adults, there is a 0.48 probability that a randomly selected adult uses a

University of California, Los Angeles Department of Statistics. Hypothesis testing

A Statistical hypothesis is a conjecture about a population parameter. This conjecture may or may not be true. The null hypothesis, symbolized by H

Power and Type II Error

Chapter 5: Hypothesis testing

Class 27. Daniel B. Rowe, Ph.D. Department of Mathematics, Statistics, and Computer Science. Marquette University MATH 1700

April 18, 2017 CONFIDENCE INTERVALS AND HYPOTHESIS TESTING, UNDERGRADUATE MATH 526 STYLE

Chapter 20. Comparing Two Proportions. BPS - 5th Ed. Chapter 20 1

AP Statistics Review Ch. 8

Sampling Distributions, Z-Tests, Power

Statistical Inference (Chapter 10) Statistical inference = learn about a population based on the information provided by a sample.

Data Analysis and Statistical Methods Statistics 651

STATISTICAL INFERENCE

Agreement of CI and HT. Lecture 13 - Tests of Proportions. Example - Waiting Times

Lecture 5: Parametric Hypothesis Testing: Comparing Means. GENOME 560, Spring 2016 Doug Fowler, GS

Estimation of a population proportion March 23,

Computing Confidence Intervals for Sample Data

Confidence Interval for one population mean or one population proportion, continued. 1. Sample size estimation based on the large sample C.I.

Important Formulas. Expectation: E (X) = Σ [X P(X)] = n p q σ = n p q. P(X) = n! X1! X 2! X 3! X k! p X. Chapter 6 The Normal Distribution.

1 Constructing and Interpreting a Confidence Interval

Confidence Intervals QMET103

Statistics 511 Additional Materials

Topic 9: Sampling Distributions of Estimators

MIT : Quantitative Reasoning and Statistical Methods for Planning I

This is an introductory course in Analysis of Variance and Design of Experiments.

UNIVERSITY OF TORONTO Faculty of Arts and Science APRIL/MAY 2009 EXAMINATIONS ECO220Y1Y PART 1 OF 2 SOLUTIONS

Sample questions. 8. Let X denote a continuous random variable with probability density function f(x) = 4x 3 /15 for

Statistical inference: example 1. Inferential Statistics

CH19 Confidence Intervals for Proportions. Confidence intervals Construct confidence intervals for population proportions

GG313 GEOLOGICAL DATA ANALYSIS

Describing the Relation between Two Variables

The Hong Kong University of Science & Technology ISOM551 Introductory Statistics for Business Assignment 3 Suggested Solution

MATH 320: Probability and Statistics 9. Estimation and Testing of Parameters. Readings: Pruim, Chapter 4

Direction: This test is worth 150 points. You are required to complete this test within 55 minutes.

Big Picture. 5. Data, Estimates, and Models: quantifying the accuracy of estimates.

Error & Uncertainty. Error. More on errors. Uncertainty. Page # The error is the difference between a TRUE value, x, and a MEASURED value, x i :

Samples from Normal Populations with Known Variances

A statistical method to determine sample size to estimate characteristic value of soil parameters

Transcription:

1 Chapter 7 ad 8 Review for Exam Chapter 7 Estimates ad Sample Sizes 2 Defiitio Cofidece Iterval (or Iterval Estimate) a rage (or a iterval) of values used to estimate the true value of the populatio parameter Lower # < populatio parameter < Upper # As a example 0.476 < p < 0.544 3

4 Cofidece Iterval for Populatio Proportio pˆ - E < p E = where z a / 2 < p ˆ + E ˆ p q ˆ Notatio for Proportios p = populatio proportio p ˆ = x sample proportio (proouced p-hat ) of x successes i a sample of size q ˆ = 1 - p ˆ = sample proportio of x failures i a sample size of 5 Roud-Off Rule for Cofidece Iterval Estimates of p Roud the cofidece iterval limits to three sigificat digits 6

7 Procedure for Costructig a Cofidece Iterval for p 1. Verify that the required assumptios are satisfied. (The sample is a simple radom sample, the coditios for the biomial distributio are satisfied, ad the ormal distributio ca be used to approximate the distributio of sample proportios because p 5, ad q 5 are both satisfied). 2. Refer to Table A-2 ad fid the critical value z a/2 that correspods to the desired cofidece level. 3. Evaluate the margi of error E = p q ˆˆ Procedure for Costructig a Cofidece Iterval for p 4. Usig the calculated margi of error, E ad the value of the sample proportio, p, ˆ, fid the values of p ˆ E ad p ˆ + E.. Substitute those values i the geeral format for the cofidece iterval: p ˆ E < p < p ˆ + E 5. Roud the resultig cofidece iterval limits to three sigificat digits. 8 Example: I the Chapter Problem, we oted that 829 adult Miesotas were surveyed, ad 51% of them are opposed to the use of the photo-cop for issuig traffic tickets. Use these survey results. Fid the 95% cofidece iterval estimate of the populatio proportio p. 9

10 Example: I the Chapter Problem, we oted that 829 adult Miesotas were surveyed, ad 51% of them are opposed to the use of the photo-cop for issuig traffic tickets. Use these survey results. First, we check for assumptios. We ote that p = 422.79 5, ad q = 406.21 5. ˆ ˆ Next, we calculate the margi of error. We have foud that p = 0.51, q = 1 0.51 = 0.49, z a/ 2 = 1.96, ad = 829. ˆ E = 1.96 E = 0.03403 ˆ (0.51)(0.49) 829 Example: I the Chapter Problem, we oted that 829 adult Miesotas were surveyed, ad 51% of them are opposed to the use of the photo-cop for issuig traffic tickets. Use these survey results. Fid the 95% cofidece iterval for the populatio proportio p. We substitute our values from Part a to obtai: 0.51 0.03403 < p < 0.51 + 0.03403, 0.476 < p < 0.544 11 Example: I the Chapter Problem, we oted that 829 adult Miesotas were surveyed, ad 51% of them are opposed to the use of the photo-cop for issuig traffic tickets. Use these survey results. Based o the results, ca we safely coclude that the majority of adult Miesotas oppose use of the photo-cop? Based o the survey results, we are 95% cofidet that the limits of 47.6% ad 54.4% cotai the true percetage of adult Miesotas opposed to the photo-cop. The percetage of opposed adult Miesotas is likely to be ay value betwee 47.6% ad 54.4%. However, a majority requires a percetage greater tha 50%, so we caot safely coclude that the majority is opposed (because the etire cofidece iterval is ot greater tha 50%). 12

13 Estimatig a Populatio Mea: s Not Kow Cofidece Iterval for the Estimate of m Based o a Ukow s ad a Small Simple Radom Sample from a Normally Distributed Populatio x - E < µ < x + E where E = t s a/2 t a/2 foud i Table A-3 14 Table A-3 t Distributio Degrees of freedom.005 (oe tail).01 (two tails).01 (oe tail).02 (two tails).025 (oe tail).05 (two tails).05 (oe tail).10 (two tails).10 (oe tail).20 (two tails).25 (oe tail).50 (two tails) 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 Large (z) 63.657 9.925 5.841 4.604 4.032 3.707 3.500 3.355 3.250 3.169 3.106 3.054 3.012 2.977 2.947 2.921 2.898 2.878 2.861 2.845 2.831 2.819 2.807 2.797 2.787 2.779 2.771 2.763 2.756 2.575 31.821 6.965 4.541 3.747 3.365 3.143 2.998 2.896 2.821 2.764 2.718 2.681 2.650 2.625 2.602 2.584 2.567 2.552 2.540 2.528 2.518 2.508 2.500 2.492 2.485 2.479 2.473 2.467 2.462 2.327 12.706 4.303 3.182 2.776 2.571 2.447 2.365 2.306 2.262 2.228 2.201 2.179 2.160 2.145 2.132 2.120 2.110 2.101 2.093 2.086 2.080 2.074 2.069 2.064 2.060 2.056 2.052 2.048 2.045 1.960 6.314 2.920 2.353 2.132 2.015 1.943 1.895 1.860 1.833 1.812 1.796 1.782 1.771 1.761 1.753 1.746 1.740 1.734 1.729 1.725 1.721 1.717 1.714 1.711 1.708 1.706 1.703 1.701 1.699 1.645 3.078 1.886 1.638 1.533 1.476 1.440 1.415 1.397 1.383 1.372 1.363 1.356 1.350 1.345 1.341 1.337 1.333 1.330 1.328 1.325 1.323 1.321 1.320 1.318 1.316 1.315 1.314 1.313 1.311 1.282 1.000.816.765.741.727.718.711.706.703.700.697.696.694.692.691.690.689.688.688.687.686.686.685.685.684.684.684.683.683.675 15

16 Example: A study of 12 Dodge Vipers ivolved i collisios resulted i repairs averagig $26,227 ad a stadard deviatio of $15,873. Fid the 95% iterval estimate of m, the mea repair cost for all Dodge Vipers ivolved i collisios. (The 12 cars distributio appears to be bell-shaped.) x = 26,227 s = 15,873 a = 0.05 a/2 = 0.025 t a /2 = 2.201 E = t a / 2 s = (2.201)(15,873) = 10,085.3 12 x - E < µ < x + E 26,227-10,085.3 < µ < 26,227 + 10,085.3 $16,141.7 < µ < $36,312.3 We are 95% cofidet that this iterval cotais the average cost of repairig a Dodge Viper. Ed of 7-2 ad 7-3 Determiig Sample Size Required to Estimate p ad m 17 Sample Size for Estimatig Proportio p Whe a estimate of p is kow: = ˆ z a/2 E 2 ˆ ˆ ( ) 2 p q ˆ Formula 7-2 Whe o estimate of p is kow: ( z ) 2 0.25 Formula 7-3 = a/2 E 2 18

19 Example: Example:We wat to determie, with a margi of error of four percetage poits, the curret percetage of U.S. households usig e-mail. Assumig that we wat 90% cofidece i our results, how may households must we survey? A 1997 study idicates 16.9% of U.S. households used e-mail. = [z a /2 ] 2 p q E 2 ˆˆ = [1.645] 2 (0.169)(0.831) 0.04 2 = 237.51965 = 238 households To be 90% cofidet that our sample percetage is withi four percetage poits of the true percetage for all households, we should radomly select ad survey 238 households. Example: Example:We wat to determie, with a margi of error of four percetage poits, the curret percetage of U.S. households usig e-mail. Assumig that we wat 90% cofidece i our results, how may households must we survey? There is o prior iformatio suggestig a possible value for the sample percetage. = [z a /2 ] 2 (0.25) E = (1.645) 2 (0.25) 2 0.04 2 = 422.81641 = 423 households With o prior iformatio, we eed a larger sample to achieve the same results with 90% cofidece ad a error of o more tha 4%. 20 Sample Size for Estimatig Mea m E = z s a/2 (solve for by algebra) = z a/2 E s 2 Formula 7-5 z a /2 /2 = critical z score based o the desired degree of cofidece E = desired margi of error s = populatio stadard deviatio 21

22 Example: If we wat to estimate the mea weight of plastic discarded by households i oe week, how may households must be radomly selected to be 99% cofidet that the sample mea is withi 0.25 lb of the true populatio mea? (A previous study idicates the stadard deviatio is 1.065 lb.) a = 0.01 z a/2 = 2.575 E = 0.25 s = 1.065 2 2 = z a/2 s = (2.575)(1.065) E 0.25 = 120.3 = 121 households We would eed to radomly select 121 households ad obtai the average weight of plastic discarded i oe week. We would be 99% cofidet that this mea is withi 1/4 lb of the populatio mea. Chapter 8 Hypothesis Testig 23 Claim: Usig math symbols H 0 : Must cotai equality H 1 : Will cotai,, <, > 24

25 Test Statistic The test statistic is a value computed from the sample data, ad it is used i makig the decisio about the rejectio of the ull hypothesis. /\ z = p - p pq Test statistic for proportios Test Statistic The test statistic is a value computed from the sample data, ad it is used i makig the decisio about the rejectio of the ull hypothesis. t = x - µ x s Test statistic for mea 26 Test Statistic The test statistic is a value computed from the sample data, ad it is used i makig the decisio about the rejectio of the ull hypothesis. c 2 = ( 1)s2 s 2 Test statistic for stadard deviatio 27

28 Critical Regio Set of all values of the test statistic that would cause a rejectio of the ull hypothesis Critical Regios Critical Value Ay value that separates the critical regio (where we reject the ull hypothesis) from the values of the test statistic that do ot lead to a rejectio of the ull hypothesis Reject H 0 Fail to reject H 0 Critical Value ( z score ) 29 Two-tailed, Right-tailed, tailed, Left-tailed tailed Tests The tails i a distributio are the extreme regios bouded by critical values. 30

31 Decisio Criterio Traditioal method: Reject H 0 if the test statistic falls withi the critical regio. Fail to reject H 0 if the test statistic does ot fall withi the critical regio. Wordig of Fial Coclusio Figure 8-7 32 Comprehesive Hypothesis Test 33

34 Example: It was foud that 821 crashes of midsize cars equipped with air bags, 46 of the crashes resulted i hospitalizatio of the drivers. Usig the 0.01 sigificace level, test the claim that the air bag hospitalizatio is lower tha the 7.8% rate for cars with automatic tic safety belts. Claim: p < 0.078 p = 46 / 821 = 0.0560 reject H 0 H 0 : p = 0.078 H 1 : p < 0.078 a = 0.01 p = 0.056 z = - 2.35 p - p 0.056-0.078 pq (0.078 )(0.922) 821 z = =» - 2.35 z = - 2.33 p = 0.078 There is sufficiet evidece to support claim that the air bag hospitalizatio rate is lower tha the 7.8% rate for automatic safety belts. 8-5 Testig a Claim about a Mea: s Not Kow 35 Example: Seve axial load scores are listed below. At the 0.01 level of sigificace, test the claim that this sample comes from a populatio with a mea that is greater tha 165 lbs. 270 273 258 204 254 228 282 = 7 df = 6 x = 252.7 lb s = 27.6 lb Claim: Claim: µ > 165 lb H 0 : µ = 165 lb H 1 : µ > 165 lb (right tailed test) 36

37 a = 0.01 0.01 165 0 t = 3.143 252.7 t = 8.407 x - µ t = x 252.7-165 s = = 8.407 27.6 7 Reject H o Example: Seve axial load scores are listed below. At the 0.01 level of sigificace, test the claim that this sample comes from a populatio with a mea that is greater tha 165 lbs. 270 273 258 204 254 228 282 Fial coclusio: There is sufficiet evidece to support the claim that the sample comes from a populatio with a mea greater tha 165 lbs. Reject Claim: µ > 165 lb H 0 : µ = 165 lb H 1 : µ > 165 lb (right tailed test) 38 8-6 Testig a Claim about a Stadard Deviatio or Variace 39

40 Chi-Square Distributio Test Statistic X 2 = ( - 1) s 2 s 2 s 2 s 2 = sample size = sample variace = populatio variace (give i ull hypothesis) Critical Values ad P-values for Chi-Square Distributio Foud i Table A-4 Degrees of freedom = -1 Based o cumulative areas from the RIGHT 41 Table A-4: Critical values are foud by determiig the area to the RIGHT of the critical value. 0.025 0.975 0.025 df = 80 a = 0.05 a/2 = 0.025 57.153 106.629 42

43 Example: Aircraft altimeters have measurig errors with a stadard deviatio of 43.7 ft. With ew productio equipmet, 81 altimeters ers measure errors with a stadard deviatio of 52.3 ft. Use the 0.0505 sigificace level to test the claim that the ew altimeters have a stadard deviatio differet from the old value of 43.7 ft. Claim: s 43.7 H 0 : s = 43.7 H 1 : s 43.7 0.025 a = 0.05 0.975 a/2 = 0.025 0.025 = 81 df = 80 Table A-4 57.153 106.629 x 2 = ( -1)s 2 (81-1) (52.3) = 2» 114.586 s 2 43.7 2 Reject H 0 57.153 106.629 x 2 = 114.586 44 Example: Aircraft altimeters have measurig errors with a stadard deviatio of 43.7 ft. With ew productio equipmet, 81 altimeters ers measure errors with a stadard deviatio of 52.3 ft. Use the 0.0505 sigificace level to test the claim that the ew altimeters have a stadard deviatio differet from the old value of 43.7 ft. SUPPORT REJECT Claim: s 43.7 H 0 : s = 43.7 H 1 : s 43.7 The ew productio method appears to be worse tha the old method. The data supports that there is more variatio i the error readigs tha before. 45

46 Table 8-3 Hypothesis Tests Parameter Coditios Distributio ad Test Statistic Critical ad P-values Proportio p =5 ad q =5 Normal: z p = ˆ p p q Table A-2 Mea σ ot kow ad ormally distributed or =30 Studet t: t = X s µ X Table A-3 Stadard Deviatio or Variace Populatio ormally distributed Chi-Square: x 2 ( 1) = 2 σ 2 s Table A-4 47